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Abstract 


Porcine epidemic diarrhea virus (PEDV) and porcine deltacoronavirus (PDCoV) 
are epizootic swine viruses. To detect and study the evolution of PEDV and PDCoV 
in central China (Shanxi, Henan, Hubei province), 70 clinical intestinal and fecal 
samples from piglets with severe watery diarrhea during August 2015 and June 2016 
were collected, tested and analyzed. PEDV was more frequently detected by PCR 
than PDCoV. Phylogenetic analysis of S genes showed that the 10 PEDV strains from 
this study clustered into G2a (n=7) and G2b (n=3) groups. Additionally, the three G2b 
strains (PEDV S2A) contained the same specific 3nt deletion in 52 as other reference 
strains in G2b. Interestingly, complete genome analysis indicated that CH/hubei/2016 
was closer to the US INDEL strain and С2а group. CH/hubei/2016 had опе 
recombination event іп 52 gene which may have resulted from AH2012-12 (from G2b 
group) and CH-ZMDZY-11 (from G2a group). Furthermore, 10 purifying selection 
sites in S gene indicated an adaptive evolution of PEDV in central China swine herds. 
These results suggested that Pandemic G2a and G2b are predominant PEDV genotype 
circulating in central China. In addition, the deletion and recombination identified in S 
gene suggested PEDV strains of central exhibited an evolutionary variety. However, 
whether these changes affect the pathogenicity and antigenicity of wild PEDV is 
unknown and is worth for further investigation. 
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1. Introduction 

Porcine epidemic diarrhea (PED) is an acute, highly contagious swine disease 
caused by porcine epidemic diarrhea virus (PEDV), which leads to severe watery 
vomiting, diarrhea, and high mortality in newborn piglets (Pensaert and de Bouck, 
1978; Song and Park, 2012). PEDV is an enveloped single-stranded RNA 
Coronavirus that belongs to the family Coronaviridae, genus Alphacoronavirus. 
PEDV genome (28 kb in length) has seven open reading frames (ORFs) and encodes 
four structural proteins: spike (S), envelope (E), membrane (M) and nucleocapsid (N) 
as well as three non-structural proteins: replicates la, 1b and ORF3 (Song and Park, 
2012). Among these proteins, PEDV S glycoprotein shows an important role on host 
cell entry and viral virulence and could be divided structurally into the 51 and 52 
regions (Song and Park, 2012), which is also a key protein exhibiting major genetic 
variations among different stains (Sun et al., 2015). 

PEDV was first reported in UK in 1971 and then was detected in other European 
countries during the following 6 years (Chasey and Cartwright, 1978; Wood, 1977). 
Since 1980s, it has been widespread in Asia (Song and Park, 2012). A massive PED 
outbreaks were reported in China in 2010, which associated with a novel PEDV strain 
(Pandemic strains) that was genetically distant from the Prototype PEDV strain 
CV777 (Chen et al., 2010). The first case of PED (Pandemic strains) in North 
America was reported in April, 2013. In 2014, the new variants, INDEL strains, 
containing insertions and deletions in the N terminal region of the 51 gene was found 
in American (Vlasova et al, 2014). Genetic and phylogenetic analyses indicated 
PEDV strains fell clearly into two distinct groups, G1 (Prototype strain) and G2 
(Pandemic strains) based on neighbor-joining method; G2 further separated into two 
subgroups:G2a and G2b (Huang et al, 2013). 

Porcine deltacoronavirus (PDCoV) is a proposed member of family Coronaviridae, 
genus deltacoronavirus, which also lead to severe watery vomiting, diarrhea (Woo et 
al, 2012). PDCoV has been firstly identified in Hong Kong 2012, and then 
successively detected in China mainland and USA (Song et al, 2015; Woo et al., 


2012). Like PEDV S gene, the phylogenetic relationship of PDCoV is also 
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constructed in terms of S genes (Ма et al, 2015; Zhai et aL, 2016). 

In present study, a total of 70 samples from central China during 2015 to 2016 were 
simultaneously tested for PEDV and PDCoV by PCR. S genes of PEDVs and 
complete gene of PEDV from positive samples were sequenced, respectively. 
Phylogenetic, sequence, recombination and selection pressures analysis of PEDV 
were performed to construct evolution relationships between the strains from central 
China and other country whereas one PDCoV S gene was also sequenced and 
analyzed. This work may provide new insights in the epidemiology of PEDV in 
central China. 

2. Methods 
2.1. Sample collection 

A total of 70 clinical intestinal and fecal samples were collected from different 
regions of central China (Henan, Hubei, Shanxi province) between 2015 and 2016. 
These samples came from pigs with a variety of clinical signs including vomiting, 
anorexia, and severe watery diarrhea. All clinical samples were homogenized for 
RNA extraction and virus isolation. 

2.2. Total RNA extraction and RT-PCR 

Total RNA was extracted from samples using Trizol reagent (TaKaRa, Japan) 
according to the manufacturer’s instructions. Reverse transcription was performed as 
previously described (Kim et al, 2001). Briefly, using 20ul of the reaction mixture 
containing 1101 of the total RNA extraction solution, I ul 10mM OligodT primer, 4ul 
5xRTbuffer, 2ul 1OMm dNTP, 1ul (40 U) RNase inhibitor, 1ul M-MLV (100 U) 
Reverse Transcriptase. After 1h kept under 42°C, the resulting cDNA was stored at 
-20°C. All sample’s cDNA was performed by PCR method as previously described 
(Kim et al., 2001; Song et al, 2015). In this study, to amplify the S gene of PEDV and 
PDCoV primers were designed based on PEDV CV777 strain (AF353511) and 
PDCoV HKUIS strain (KJ568769) (Table 1). The reaction mixture consisted of 1ul of 
cDNA, 10ul ExTaq (TAKARA), 1ш F-primer (10 pmol), 1 ul R-primer (10 рот), and 


RNase-free water in a total volume of 20ul. And the amplification was carried out as 


follows: 95°C for 5 min, followed by 30 cycles of 95°Cfor I min, 53°С for Ішіп and 72°C 
for 1 min, and then final elongation for 10 min at 72°C The complete genome of PEDV 
was amplified as previously described (Wang et al, 2013). Products were visualized 
using 196 agarose gel under ultraviolet light. 

2.3. Cloning and sequencing of PEDV and PDCoV gene 

Three pairs of primers were used to determine the PEDV S gene or PDCoV S gene 
(Table 1). The amplicons were purified and cloned into pMD-18T, then sequenced by 
Sanger method. Each fragment was independently sequenced at least three times. The 
sequenced products were splicing to obtain the PEDV S gene. After analysis of the S 
gene, the positive samples were selected and sequenced for the complete genomes 
(Wang et al., 2013). 

2.4. Phylogenetic and sequence analysis of PEDV and PDCOV 

Reference PEDV and PDCoV strains are shown in Table 2 (supplemental material 
1). PEDVs and PDCoV isolates from this study are shown in Table 3 (supplemental 
material 2). 

The PEDV S genes, PEDV complete genomes, and PDCoV S genes were aligned 
using ClustalW in DNASTAR7.1. Phylogenetic analysis for each dataset was 
performed using neighbor-joining method with 1000 bootstrap replicates in MEGA7. 
2.5. Recombination analysis 

ClustalW program was used to align the full-length genome sequences of 
CH/hubei/2016 and reference sequence from GenBank (Table 2) (supplemental 
material 1). RDP3 recombination detection program was used to detect the potential 
recombination events. RDP, BootScan, MaxChi, Chimaera, SiScan, and 3Seq methods 
embedded in the RDP3 software package (р<0.01) were utilized (Chen et al., 2013), 
only those recombination events supported by more than 4 programs were considered 
to avoid dependence on a single methodology (Li et al, 2016). Confirmation of the 
potential recombination events was carried out by BootScan analysis and fast 
neighbor-joining tree(Sun et al., 2016). 

2.6. Selection pressures analysis of S gene 


PEDV S gene deduced amino acid sequences were aligned by ClustalW program in 
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DNASTAR7.1 software, and selection pressures for this gene at individual codon sites 
were estimated by the ratio of non-synonymous (dN) and synonymous (dS) mutations, 
which was calculated by the Single Likelihood Ancestor Counting method and Fixed 
Effects Likelihood method available at the Datamonkey online version of the Hy-Phy 
package, using a significance level of 0.1 (Mu et al., 2013). 
3. Results 
3.1. PEDV and PDCoV of positive samples among diagnostic samples 

All the samples were simultaneously tested for PEDV and PDCoV by PCR. 84.2% 
(59/70 samples) were positive for PEDV, but only 2.9% (2/70samples) samples were 
PDCoV positive, which were also PEDV positive. (These data suggested PEDV 
infection could be more commonly detected than PDCoV in central China.) These 
data suggested that PEDV could be more commonly detected than PDCoV in central 
China. 
3.2. Sequence analyses of PEDV S gene 

To reveal the characteristics of PEDV strains currently circulating in central China, 
10 PEDV positive samples (Table 3 supplemental material 2) were selected and 
sequenced in terms of S gene. Compared to the S gene of CV777 (4152), sequencing 
of the field strains revealed a range of lengths: three were 6nt longer (4158nt), seven 
were Ont longer (4161nt). The analysis of PEDV S gene showed that at N-terminal 
domain (NTD) of S gene, these 10 strains included two notable insertions at 166nt to 
177nt and 337nt to 339nt, respectively, and one deletion between 477nt to 483nt at S 
gene compared with CV777. And same deletions and insertions segments were also 
observed in the same position on global Pandemic strains as previous study indicated 
(Huang et al., 2013). Besides, a specific deletion from 3596nt to 3598nt at S2 gene 
was firstly found in three PEDV strains (CH/hubei/2015, CH/hubei/2016, 
CH/huaiyang/2015) (Fig 1), respectively, which was named as PEDV S2A. 

From nucleotide identity analysis, the nucleotide of these 3 PEDV S2Astrains from 
our study had 99.3%-99.8% identity to one another, while the identity of the 


remaining 7 strains was 98.3%-99.8% to one another. However, the identity between 


these 3 PEDV S2Astrains and the remaining 7 strains were at 97.4%-98.5%. The 10 
PEDV strains shared 96.9%-99.2% nucleotide identities with reference Pandemic 
strains, 95.2 %-96.8% with reference INDEL strains and 93.5% -94.8% with reference 
Prototype strains, respectively. Moreover, the 3 PEDV S2A\strains were closely 
related to reference Pandemic G2b strains with 97.6%-98.6% nucleotide identity. 
However, the remaining 7 strains from our study were closely related to reference 
Pandemic G2a strains with 97.6%-99.2% nucleotide identified. 

3.3. Phylogenetic analysis of PEDV S gene 

The Fig 2 phylogenetic tree was established based upon multiple sequences 
alignment of S gene according to (Huang et al., 2013). All PEDV strains fell clearly 
into two genetic groups, designated as G2 (Pandemic strains) and G1 (Prototype 
PEDV). 

Obviously, PEDV Pandemic strains (G2) consisted of two groups. In current study, 
these 3 PEDV S2Astrains and other reference PEDV strains with 3nt-deletion in 52 
gene were grouped into G2b. The remaining 7 PEDV strains in this study were 
grouped into G2a. None of the strains isolated in this study were included in Gl 
(Prototype strain) and PEDV INDEL strain. 

3.4. Sequence analyses of CH/hubei/2016 complete genome 

After culturing on vero cell, only CH/hubei/2016 was successfully isolated, and its 
genome was future amplified and sequenced (Wang et al., 2013). CH/hubei/2016 of 
complete genome had 28063nt. Like PEDV Pandemic strains, CH/hubei/2016 strain 
contained a notable U insertion at 48nt, and two notable nucleotide deletion between 
72nt and 73nt (an A deletion) and between 84nt and 85nt (a 4-nt ООСС deletion) at 
the 5'untranslated region (UTR) in comparison with the prototype CV777 strain 
(Huang et al., 2013). 

From nucleotide identity analysis, CH/hubei/2016 strain had an identity at 98.2% 
with Pandemic strains G2b, 97.9%-98.0% with Pandemic strains G2a, 97.6%-97.7% 
with US INDEL strains, and 96.4%-97.2% with Prototype strains respectively. 

3.5. Phylogenetic analysis of PEDV complete genome 


As shown in Fig 3, all PEDV strains were clearly grouped into G1 and G2 group. 
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PEDV Pandemic strains (G2) consisted of two groups. CH/hubei/2016 belonged to an 
independent subgroup which was more closely related to the US S INEDL and G2a 
strains than the reference strains with 3nt deletion in S2 gene in the other subgroup. 
3.6. Recombination analysis 

The analysis revealed that four putatively recombinant breakpoints in 
CH/hubei/2016 strain were detected. One recombination event of putative breakpoint 
was located at position 23310 to 24012nt on S2 gene, which was supported by 6 
programs (RDP, p=1.961E-6; GENECONV, p=1.400E-3; BootScan, p=7.920E-5; 
MaxChi p=1.037E-3; Chimaera, p=7.286E-5; SiScan, р-3.351Е-4; 3Seq, 
p=4.564E-2). Additionally, phylogenetic trees based on the recombination region of 
each potential recombinant event and the non-recombinant region of each potential 
recombinant event displayed potential minor parent (CH-ZMDZY- 11), potential major 
parent (AH2012-12) which had 3nt-delection in $2 and potential recombinant strain 
evolutionary relationships (Fig 4). 
3.7. Selection pressures in PEDV S gene 

A ratio of dN/dS>0 or dN/dS<0 indicates positive and purify in selection, 
respectively, and a valued dN/dS=0 indicates neutral evolution. Significant selection 
is р<0.1.Тһе analysis revealed that PEDV strains isolated from central China from 
2015 to 2016 had 7 purifying selection sites in S1; and there were 3 purifying 
selection sites in 52 (Fig 5). No significant positive selected positions were found in 5 
gene. 
3.8. Sequence and Phylogenetic analysis of PDCoV S gene 

The S of PDCoV CH/HN/2016 had 3480nt which was similar with reference strains 
(3480nt or 3483nt). The S gene homology of PDCoV CH/HN/2016 strain was 
measured by aligning with the other S genes of PDCoV strains from USA, Korea, and 
Thailand. The S gene of CH/HN/2016 had an identity 98.3%-98.9 with most of 
Chinese strains, 98.2%-98.4% with USA strains, 98.0% with the CHN-AH-2004, 97.5% 
with Asian leopard cat coronavirus and Chinese ferret badger coronavirus and 96% 
with all the stains from Thailand, respectively. 


The phylogenetic analysis show that all PDCoV strains fell into СІ except Thailand 
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strains belonged to G2 (Fig 6). The PDCoV strains of China, USA and Korea had a 
closer relationship with Asian leopard cat and Chinese ferret-badger coronavirus than 
Thailand strains. In addition, PDCoV CH/HN/2016 from our study had a closer 
relationship with other PDCoV strains of China than CH/AH/2004. Further sequence 
analysis showed CHN/AH/2004 strain has 3-nt insertions at positions 153-155 (Wang 
et al, 2016). 

4. Discussion 

Since 2012, PEDV has re-emerged and rapidly disseminated all over the 
world(Huang et al, 2013). In this study, the data indicates that PEDV was more 
frequently found than PDCoV among the tested samples. The positive rate of PEDV 
in all diarrhea samples tested was 84.2 %, which was similar with previous study (Li 
et al, 2014). But, the positive rate of PDCoV was only 2.9% and lower than PDCoV 
in the south of China (Song et al., 2015). 

From sequence and phylogenetic analyses of PEDV S gene, the 10 PEDV strains in 
our study from central China were clustered into G2a and G2b. The 3 PEDV S2A 
strains were grouped into G2b Pandemic strains, and the remaining 7 PEDV strains 
were grouped into G2a Pandemic strains (Fig 2). In G2b, PEDV S2 A strains 
(CH/hubei/2015, CH/hubei/2016, CH/huaiyang/2015) with a part of reference PEDVs 
strains possessed a same specific 3-nt deletion in S2 gene (Fig 1). Interestingly, 
phylogenetic analysis of PEDV complete genomes showed that the PEDV 52 Astrain 
(CH/hubei/2016 ) was not grouped into same subgroup with the other reference 
PEDV strains possessed S2 3nt deletion (Fig 3). This phenomenon maybe explained 
by the fact that CH/hubei/2016 may result from the recombination between 
AH2012-12 (G2b) which had 3nt-delection in S2 and CH-ZMDZY-11(G2a) (Fig 4). 
Previous study revealed that recombination events in S1 gene between different 
PEDV groups may contribute to the genetic diversity of PEDV (Jarvis et al, 2016; Li 
et al, 2016; Sun et al., 2016). Furthermore, Taiwan New PEDV variants have two 
potential recombination breakpoints in 51 and 52 (Chiou et al., 2017). All these fact 


indicted that recombination, insertion and deletion in the S gene may contribute to the 


genetic diversity of PEDV. However, because of only one complete PEDV S2 Astrain 
genomes was included into the phylogenetic analysis, further study need to explore 
the evolution position of PEDV S2Astrains. 

Previous studies demonstrated that spike gene of Coronavirus is a determinant of 
virulence, and a single amino acid substitution can influence virulence (MacNamara 
et al, 2005; Sánchez et al, 1999). According to previous study, the deletion and 
insertion in PEDV 51 gene may affect the virulence of PEDV (Lin et al., 2015). 
Therefore, the specific 3nt-deletion in PEDV S2 gene (Fig 1) firstly reported in 
current study could also affect the pathogenicity and antigenicity of wild virus. 

More effective purifying selections on RNA viruses than DNA viruses was proved 
and purifying selection is an important mechanism for adaption of RNA viruses 
(Hughes and Hughes, 2007). Our study demonstrated 10 purifying selection sites were 
in central China of PEDV S gene which indicated an adaptive evolution of PEDV in 
central China swine herd. 

In current study, the strains with specific genetic signatures-3nt deletion in S2 gene 
was firstly identified and reported (PEDV S2 A strains), and one of strain 
(CH/hubei/2016) was clustered into different groups based on PEDV S gene or 
complete genomes. This fact may result from the recombination between AH2012-12 
(G2b) which had 3nt-delection in S2 and CH-ZMDZY-11(G2a). In contrast with 
central China of PEDV strains, PDCoV had low positive rates and had a close 
relationship with other PDCoV strains of China in 2012 to 2015. Future work should 


continue to trace the variability of PEDV strains and PDCoV strains. 
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Table legends 
Table 1 Primers used in this study 
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Table 2 Reference strains used in this study (supplemental material 1) 


Table 3 The PEDV strains isolated in current study (supplemental material 2) 


Figure legends 

Fig 1 S gene nucleotide alignments of СН/һибе/2016, СН /huaiyang/2015, 
CH/hubei/2016, and two reference PEDV strains: MN/USA/2013 (Pandemic strain) 
and CV777/Belgium/1978 (Prototype strain). NTD, N-terminal domain of the spike 
gene is hypervariable region of PEDV S gene. Black rectangle indicates the position 
of 3-nt deletions sites on PEDV genome. 

Fig 2 Phylogenetic tree based on the spike gene (S) sequence of PEDV strains were 
constructed using the neighbor-joining method in MEGA7 software. Numbers above 
branches indicate bootstrap p values calculated from 1,000 bootstrap replicates; only 
values greater than 50 are shown. ‘e’indicates the 10 isolates from this study. 

Fig 3 Phylogenetic tree based on the complete genome of PEDV strains were 
constructed using the neighbor-joining method in MEGA7 software. Numbers above 
branches indicate bootstrap p values calculated from 1,000 bootstrap replicates; only 
values greater than 50 are shown. *e'indicates the one isolates from this study 

Fig 4 Detection of potential recombination events in CH hubei 2016 by RDP4. a One 
recombination breakpoints were located at 23310 to 24012 nt. The analysis was 
performed with an RDP distance model, a window size of 1000 base pairs and a step 
size of 200 base pair; b Fast neighbor-joming (NJ) tree constructed using the 
recombinant region of each potential recombinant event; c Fast neighbor-joining (NJ) 
tree constructed using the non-recombinant region of each potential recombinant 
event. 

Fig 5 Identifying signatures of selection pressures in S gene of PEDV central China 
isolated. Imaginary lines indicate sites under significant selection (р<0.1) and 
‘e’shows the dN/dS estimates by site position. The x-axis refers to the amino acid 
position of S gene. 

Fig 6 Phylogenetic tree based on the spike gene (S) sequence of PDCoV strains were 


constructed using the neighbor-joining method in MEGA 7 software. Numbers above 
13 


branches indicate bootstrap p values calculated from 1,000 bootstrap replicates; only 


values greater than 50 are shown. *e'indicates the one isolates from this study. 
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Primer name 
PEDV SI-F 
PEDV SI-R 
PEDV S2-F 
PEDV S2-R 
PEDV S3-F 
PEDV S3-R 
PDCoV S1-F 
PDCoV S1-R 
PDCoV S2-F 
PDCoV S2-R 
PDCoV S3-F 
PDCoV S3-R 


Table. 1 Primers used in this study 


Nucleotide Sequence, 57-3” 
ССТААСТТССТА GTGCGTAA 
CACAGAAAGAACTAA ACCC 
TTTGGTGGTCTTA GTA GT GCC 
ОСТСТА GAACATCCGTCTGTA 
GGGCGA GACTCAATTATCTTGC 
CTGGACA GCA TCCAAAGA СААС 
TTGGCGGAACT CACA CA CTT 
TGACCCCGATACAACCTAACA 
GTGAGCA GITTAACTACA CCACT 
TTCTCAGCATCAACAACA ССА 
AGCAGCATACTAACCACCA GA 
ACTAGGGT GA A GGGITGGA GCA 


15 


Size(bp) 
1689 


1420 


1312 


1799 


1710 


1744 


è Primer Location 


20570-22258 


22189-23608 


23564-24875 


18210-20009 


19796-21506 


21310-23054 


а In relation to the genome of PEDV CV777 strain (AF353511) or PDCoV HKUIS5(KJ568769) 


Highlights 

e PEDV (84.2%) infection could be more commonly detected than PDCoV (2.9%) 
in central China. 

€ A specific 3nt-deletion in S2 gene was firstly reported in PEDV strains of central 
China. 

ө Phylogenetic and recombinant analysis provided evidence of the relationship 


between PEDV S2A and previous PEDV stains (3-deletion in S2). 
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PEDV genome 
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CH huaiyang 2015.seq ^ CTACGTGAGCCIGGCITAGICITIGITIACGCATGAACTICAAGAT---RC 3590 
CH hubei 2015.seq CTACGIGAGCCIGGCITAGICIIGITIACGCATGAACTICAAGAT---RC 3590 
CH hubei 2016.seq CIACGIGAGCCIGGCITAGICIIGITIACGCATGAACTICAAGA]---RC 3590 
CV777 belgium 1978.seq CIACGIGAGCCIGGCITAGICIIGTITTACGCATGAACTICAAACTIATRC 3584 
MN USA 2013.seq CTACGIGAGCCIGGCITAGICIIGITTACGCATGAACTICAAAATCATRC 3593 
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Figure 5 
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